SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

(i) APPLICANT: Soos, Jeanne M. 

Schif fenbauer, Joel 
Johnson, Howard M. 

(ii) TITLE OF INVENTION: Orally-Administered Interf eron-Tau 

Compositions and Methods 

(iii) NUMBER OF SEQUENCES: 6 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Dehlinger & Associates 

(B) STREET: 350 Cambridge Ave., Suite 250 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 94306 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 15-MAR-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Sholtz, Charles K. 

(B) REGISTRATION NUMBER: 38,615 

(C) REFERENCE/DOCKET NUMBER: 5600-0003 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-324-0880 

(B) TELEFAX: 415-324-0960 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ovis aries 

(B) STRAIN: Domestic 

(D) DEVELOPMENTAL STAGE: Blastula (blastocyst) 

(F) TISSUE TYPE: Trophectoderm 

(G) CELL TYPE: Mononuclear trophectoderm cells 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: oTP-la 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..516 
(x) PUBLICATION INFORMATION: 

(A) AUTHORS: Ott, Troy L 

Van Heeke, Gino 
Johnson, Howard M 
Bazer, Fuller W 

(B) TITLE: Cloning and Expression in Saccharoinyces 

cerevisiae of a Synthetic Gene for the Type I 
Trophoblast Interferon Ovine Trophoblast 
Protein-1 : Purification and Antiviral Activity 

(C) JOURNAL: J. Interferon Res. 

(D) VOLUME: 11 

(F) PAGES: 357-364 

(G) DATE: 1991 

(K) RELEVANT RESIDUES IN SEQ ID NO : 1 : FROM 1 TO 516 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TGC TAC CTG TCG CGA AAA CTG ATG CTG GAC GCT CGA GAA AAT TTA AAA 4 8 

Cys Tyr Leu Ser Arg Lys Leu Met Leu Asp Ala Arg Glu Asn Leu Lys 
5 1 5 10 15 

CTG CTG GAC CGT ATG AAT CGA TTG TCT CCG CAC AGC TGC CTG CAA GAC 96 

Leu Leu Asp Arg Met Asn Arg Leu Ser Pro His Ser Cys Leu Gin Asp 

20 25 30 

10 

CGG AAA GAC TTC GGT CTG CCG CAG GAA ATG GTT GAA GGT GAC CAA CTG 144 

Arg Lys Asp Phe Gly Leu Pro Gin Glu Met Val Glu Gly Asp Gin Leu 
35 40 45 

15 CAA AAA GAC CAA GCT TTC CCG GTA CTG TAT GAA ATG CTG CAG CAG TCT 192 
Gin Lys Asp Gin Ala Phe Pro Val Leu Tyr Glu Met Leu Gin Gin Ser 
50 55 60 

□ TTC AAC CTG TTC TAC ACT GAA CAT TCT TCG GCC GCT TGG GAC ACT ACT 24 0 

m Phe Asn Leu Phe Tyr Thr Glu His Ser Ser Ala Ala Trp Asp Thr Thr 
jg 65 70 75 80 

5 s ?: 

;;5 CTT CTA GAA CAA CTG TGC ACT GGT CTG CAA CAG CAA CTG GAC CAT CTG 288 

Leu Leu Glu Gin Leu Cys Thr Gly Leu Gin Gin Gin Leu Asp His Leu 

!#5 85 90 95 

I'M 

GAC ACT TGC CGT GGC CAG GTT ATG GGT GAA GAA GAC TCT GAA CTG GGT 336 
Asp Thr Cys Arg Gly Gin Val Met Gly Glu Glu Asp Ser Glu Leu Gly 
U 100 105 110 

30 

AAC ATG GAT CCG ATC GTT ACT GTT AAA AAA TAT TTC CAG GGT ATC TAC 384 
Asn Met Asp Pro lie Val Thr Val Lys Lys Tyr Phe Gin Gly lie Tyr 
115 120 125 

35 GAC TAC CTG CAG GAA AAA GGT TAC TCT GAC TGC GCT TGG GAA ATC GTA 432 
Asp Tyr Leu Gin Glu Lys Gly Tyr Ser Asp Cys Ala Trp Glu lie Val 
130 135 140 

CGC GTT GAA ATG ATG CGG GCC CTG ACT GTG TCG ACT ACT CTG CAA AAA 4 80 

4 0 Arg Val Glu Met Met Arg Ala Leu Thr Val Ser Thr Thr Leu Gin Lys 
145 150 155 160 

CGG TTA ACT AAA ATG GGT GGT GAC CTG AAT TCT CCG 516 
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Arg Leu Thr Lys Met Gly Gly Asp Leu Asn Ser Pro 
165 170 



(2) INFORMATION FOR SEQ ID NO: 2: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: protein 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: amino acid sequence of a mature 
15 OvIFNtau protein 

u (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

O 

O Cys Tyr Leu Ser Arg Lys Leu Met Leu Asp Ala Arg Glu Asn Leu Lys 

% 1 5 10 15 

rf>\ 

j4 Leu Leu Asp Arg Met Asn Arg Leu Ser Pro His Ser Cys Leu Gin Asp 
□ 20 25 30 

l$p Arg Lys Asp Phe Gly Leu Pro Gin Glu Met Val Glu Gly Asp Gin Leu 

•J-! 35 4 0 45 

Q Gin Lys Asp Gin Ala Phe Pro Val Leu Tyr Glu Met Leu Gin Gin Ser 
M 50 55 60 

30 

Phe Asn Leu Phe Tyr Thr Glu His Ser Ser Ala Ala Trp Asp Thr Thr 
65 70 75 80 

Leu Leu Glu Gin Leu Cys Thr Gly Leu Gin Gin Gin Leu Asp His Leu 
35 85 90 95 

Asp Thr Cys Arg Gly Gin Val Met Gly Glu Glu Asp Ser Glu Leu Gly 
100 105 110 

40 Asn Met Asp Pro lie Val Thr Val Lys Lys Tyr Phe Gin Gly lie Tyr 
115 120 125 

Asp Tyr Leu Gin Glu Lys Gly Tyr Ser Asp Cys Ala Trp Glu lie Val 



42 



130 135 140 

Arg Val Glu Met Met Arg Ala Leu Thr Val Ser Thr Thr Leu Gin Lys 
145 150 155 160 

5 

Arg Leu Thr Lys Met Gly Gly Asp Leu Asn Ser Pro 
165 170 



10 (2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 516 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

! s «* (ii) MOLECULE TYPE: cDNA 

.: :ss. 

% (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: synthetic nucleotide sequence encoding 
^fk a mature human interf eron-tau protein, HuIFNtaul. 



»"Z5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TGTGACTTGT CTCAAAACCA CGTTTTGGTT GGTAGAAAGA ACTTAAGACT ACTAGACGAA 60 

□ ATGAGACGTC TATCTCCACG CTTCTGTCTA CAAGACAGAA AGGACTTCGC TTTGCCTCAG 120 

30 GAAATGGTTG AAGGTGGCCA ACTACAAGAA GCTCAAGCGA TATCTGTTTT GCACGAAATG 18 0 

TTGCAACAAA GCTTCAACTT GTTCCACACC GAACACTCTT CGGCCGCTTG GGACACCACC 24 0 

TTGTTGGAAC AGCTCAGAAC CGGTTTGCAC CAACAATTGG ACAACTTGGA TGCATGTTTG 300 

35 

GGTCAAGTTA TGGGTGAAGA AGACTCTGCT CTCGGGAGAA CCGGTCCAAC GCTAGCTTTG 360 

AAGAGATACT TCCAAGGTAT CCACGTTTAC TTGAAGGAAA AGGGTTACTC TGACTGTGCT 4 20 

4 0 TGGGAAACCG TGCGTCTAGA AATCATGCGT AGCTTCTCTT CTTTGATCAG CTTGCAAGAA 480 

AGATTACGTA TGATGGACGG TGACTTGTCG AGCCCA 516 
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INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: amino acid sequence for a mature 
HuIFNtau protein, HuIFNtaul. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Cys Asp Leu Ser Gin Asn His Val Leu Val Gly Arg Lys Asn Leu Arg 
15 10 15 

Leu Leu Asp Glu Met Arg Arg Leu Ser Pro Arg Phe Cys Leu Gin Asp 
20 25 30 

Arg Lys Asp Phe Ala Leu Pro Gin Glu Met Val Glu Gly Gly Gin Leu 
35 40 45 

Gin Glu Ala Gin Ala He Ser Val Leu His Glu Met Leu Gin Gin Ser 
50 55 60 

Phe Asn Leu Phe His Thr Glu His Ser Ser Ala Ala Trp Asp Thr Thr 
65 70 75 80 

Leu Leu Glu Gin Leu Arg Thr Gly Leu His Gin Gin Leu Asp Asn Leu 
85 90 95 

Asp Ala Cys Leu Gly Gin Val Met Gly Glu Glu Asp Ser Ala Leu Gly 
100 105 110 

Arg Thr Gly Pro Thr Leu Ala Leu Lys Arg Tyr Phe Gin Gly He His 
115 120 125 

Val Tyr Leu Lys Glu Lys Gly Tyr Ser Asp Cys Ala Trp Glu Thr Val 
130 135 140 
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Arg Leu Glu lie Met Arg Ser Phe Ser Ser Leu He Ser Leu Gin Glu 
145 150 155 160 

Arg Leu Arg Met Met Asp Gly Asp Leu Ser Ser Pro 
5 165 170 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

Q 

Q (iv) ANTI-SENSE: NO 

% 

jS: (vi) ORIGINAL SOURCE: 



□ 



(C) INDIVIDUAL ISOLATE: HuIFNtau3, mature no leader sequence 



(ix) FEATURE: 



■$5 (A) NAME /KEY : CDS 

«* (B) LOCATION: 1. .516 

! y 

□ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

30 TGT GAC CTG TCT CAG AAC CAC GTG CTG GTT GGC AGC CAG AAC CTC AGG 4 8 

Cys Asp Leu Ser Gin Asn His Val Leu Val Gly Ser Gin Asn Leu Arg 
15 10 15 

CTC CTG GGC CAA ATG AGG AGA CTC TCC CTT CGC TTC TGT CTG CAG GAC 96 
35 Leu Leu Gly Gin Met Arg Arg Leu Ser Leu Arg Phe Cys Leu Gin Asp 
20 25 30 

AGA AAA GAC TTC GCT TTC CCC CAG GAG ATG GTG GAG GGT GGC CAG CTC 14 4 

Arg Lys Asp Phe Ala Phe Pro Gin Glu Met Val Glu Gly Gly Gin Leu 
40 35 40 45 

CAG GAG GCC CAG GCC ATC TCT GTG CTC CAC GAG ATG CTC CAG CAG AGC 192 
Gin Glu Ala Gin Ala He Ser Val Leu His Glu Met Leu Gin Gin Ser 
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50 55 60 

TTC AAC CTC TTC CAC ACA GAG CAC TCC TCT GCT GCC TGG GAC ACC ACC 240 
Phe Asn Leu Phe His Thr Glu His Ser Ser Ala Ala Trp Asp Thr Thr 
5 65 70 75 80 

CTC CTG GAG CAG CTC CGC ACT GGA CTC CAT CAG CAG CTG GAT GAC CTG 288 

Leu Leu Glu Gin Leu Arg Thr Gly Leu His Gin Gin Leu Asp Asp Leu 

85 90 95 

10 

GAT GCC TGC CTG GGG CAG GTG ACG GGA GAG GAA GAC TCT GCC CTG GGA 336 

Asp Ala Cys Leu Gly Gin Val Thr Gly Glu Glu Asp Ser Ala Leu Gly 

100 105 110 

15 AGA ACG GGC CCC ACC CTG GCC ATG AAG AGG TAT TTC CAG GGC ATC CAT 384 
Arg Thr Gly Pro Thr Leu Ala Met Lys Arg Tyr Phe Gin Gly lie His 
115 120 125 

r% 

Q GTC TAC CTG AAA GAG AAG GGA TAT AGT GAC TGC GCC TGG GAA ATT GTC 4 32 

&l) Val Tyr Leu Lys Glu Lys Gly Tyr Ser Asp Cys Ala Trp Glu lie Val 
•Jf 130 135 140 

Q AGA CTG GAA ATC ATG AGA TCC TTG TCT TCA TCA ACC AGC TTG CAC AAA 480 
n Arg Leu Glu lie Met Arg Ser Leu Ser Ser Ser Thr Ser Leu His Lys 
h?5 145 150 155 160 

;T AGG TTA AGA ATG ATG GAT GGA GAC CTG AGC TCA CCT 516 
i;3 Arg Leu Arg Met Met Asp Gly Asp Leu Ser Ser Pro 
M* 165 17 0 

30 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
Cys Asp Leu Ser Gin Asn His Val Leu Val Gly Ser Gin Asn Leu Arg 
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10 



15 



Leu Leu Gly Gin Met Arg Arg Leu Ser Leu Arg Phe Cys Leu Gin Asp 
20 25 30 

5 

Arg Lys Asp Phe Ala Phe Pro Gin Glu Met Val Glu Gly Gly Gin Leu 
35 40 45 

Gin Glu Ala Gin Ala He Ser Val Leu His Glu Met Leu Gin Gin Ser 
10 50 55 60 

Phe Asn Leu Phe His Thr Glu His Ser Ser Ala Ala Trp Asp Thr Thr 
65 70 75 80 

15 Leu Leu Glu Gin Leu Arg Thr Gly Leu His Gin Gin Leu Asp Asp Leu 

85 90 95 

n Asp Ala Cys Leu Gly Gin Val Thr Gly Glu Glu Asp Ser Ala Leu Gly 



100 



105 



110 



|;:J Arg Thr Gly Pro Thr Leu Ala Met Lys Arg Tyr Phe Gin Gly He His 
115 120 125 

Val Tyr Leu Lys Glu Lys Gly Tyr Ser Asp Cys Ala Trp Glu He Val 
j|5 130 135 140 

\X Arg Leu Glu He Met Arg Ser Leu Ser Ser Ser Thr Ser Leu His Lys 

ri 145 150 155 160 



30 Arg Leu Arg Met Met Asp Gly Asp Leu Ser Ser Pro 

165 170 
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